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Abstract 

*  « 

We  present  a  genera!  method  for  translating  sorting  by  coniparisonsjfclgfyrthms  to 
algorithms  that  compute  a  Hamilton  path  in  a  tournament.  The  translation  is  based 
on  the  relation  between  minima)  feedback  sets  and  Hamilton  paths  in  tournaments.  We 
prove  that  there  is  a  one  to  one  corr»'spondenro  between  the  set  of  minimal  feedback  sets 
and  the  set  of  Hamilton  paths.  In  tlie  comparison  model,  ail  the  tradeoffs  for  sorting 
between  the  number  of  processors  and  the  numhor  of  rounds  hold  when  a  Hamilton  path 
is  computed.  For  the  CRCW  model,  with  0(»)  processors,  we  show  the  following:(i) 
Two  paths  in  a  tournament  can  be  merged  in  O(loglogn)  time  (Valiant’s  algorithm 
fVa]);  (ii)  a  Hamilton  path  can  be  computed  in  O(logn)  time  (Cole’s  algorithm).  This 
improves  a  previous  algorithm  for  computing  a  Hamilton  patlij4vhof»e-r«iwi«g  time  wae 
•  0(log^  n)  using  0{ft^)  pweseors.  '  V 
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1  Introduction 


A  tournainont  T  =  {W  D)  (|V'|  =  n)  is  a  directed  graph  in  which  every  pair  of  vertices  is 
joined  by  a  directed  edge.  It  can  be  viewed  as  a  complete  graph  whose  edges  are  given  an 
orientation.  A  Hamilton  path  in  a  graph  is  a  simple  path  that  contains  all  the  vertices,  and 
each  vertex  appears  exactly  once.  A  w'cll  known  theorem  states  that  there  is  a  Hamilton 
path  in  every  tournament  [Rc,  Be]. 

In  this  paper  we  investigate  the  complexity  of  computing  a  Hamilton  path  in  a  tourna¬ 
ment  and  other  problems  related  to  it.  Our  methodology  relies  on  the  intimate  relationship 
that  exists  between  Hamilton  paths  and  minimal  feedback  sets  in  tournaments,  and  their 
connection  to  sorting  algorithms.  Sorting  by  comparisons  may  be  viewed  as  computing  a 
Hamilton  path  in  a  transitive  (acyclic)  tournament.  The  purpose  of  this  paper  is  to  show 
the  opposite  direction,  namely  how  sorting  algorithms  can  be  generalized  to  compute  a 
Hamilton  path  in  an  arbitrary  tournament. 

Sorting  by  comparisons  is  a  well  investigated  problem,  perhaps  the  most  in  computer 
science.  We  siiow  how  to  exploit  the  wealth  of  results  on  it  to  design  parallel  and  sequential 
algorithms  for  arbitrary  tournaments. 

Parallel  algorithms  to  compute  a  Hamilton  path  in  a  tournament  have  appeared  in  [Na, 
So],  but  were  designed  by  ad  hoc  principles.  The  key  idea  in  computing  the  Hamilton  path 
in  both  papers  is  the  following:  in  every  tournament  there  exists  a  vertex  (separator)  whose 
indegree  and  outdogree  are  bounded  from  below  by  lV|/4;  this  gives  a  recursive  formulation 
of  the  problem  with  only  a  logarithmic  number  of  steps.  The  difficulty  with  this  approach  is 
that  the  best  bound  known  for  finding  a  separator  requires  0{v?)  processors.  Ramachandran 
[Ra]  showed  using  adversary  arguments  that  a  lower  bound  on  the  number  of  edges  whose 
orientation  must  be  known  in  order  to  find  a  separator  is  n(n^). 

In  analogy  to  sorting,  we  define  two  additional  problems  on  tournaments: 

♦  Merging  two  paths  such  that  the  relative  order  of  the  vertices  in  the  original  paths 
remains  after  the  merge. 

♦  ifc-selection,  e.g.  finding  the  kih  vertex  in  the  Hamilton  path  in  a  transitive  tourna¬ 
ment,  is  generalized  to  finding  a  vertex  which  is  the  kih  in  some  Hamilton  path  in  an 
arbitrary  tournament. 

We  now  present  a  summary  of  our  method.  A  minimal  edge  feedback  set  f  in  a  directed 
graph  G  -  (V\  D)  is  a  set  of  edges  such  that  G*  =  (V,Z)  -  F)  is  acyclic,  and  F  is  minimal 
with  respect  to  containment.  We  prove  that  there  is  a  one  to  one  correspondence  between 
the  set  of  minimal  feedback  sets  and  the  set  of  Hamilton  paths  in  an  arbitrary  tournament, 
and  show  how  a  minima!  feedback  set  can  generate  a  Hamilton  path  and  vice  versa.  We  also 


show  that  an  edge  in  a  transitive  ton rii anient  whose  orientation  is  known  by  inipHcatioiu 
cannot  appear  in  the  Hamilton  path.  Assume  now  that  the  input  to  a  sorting  algorithm  is 
not  a  transitive  tournament  but  rather  any  arbitrary  tournament.  If  a  minimal  feedback 
is  computed  for  each  round  of  comparisons  and  its  orientation  flipped,  namely  the  sorting 
algorithm  is  “cheated’\  then  the  Hamilton  path  computed  by  the  sorting  algorithm  will  also 
be  a  Hamilton  path  for  the  original  input.  Intuitively,  this  happens  because  the  edges  on 
which  we  ‘‘cheated”  are  actually  implications.  These  notions  are  formalized  and  proved,  and 
they  result  in  a  general  method  for  translating  any  sorting  (or  sorting  related)  algorithm  to 
an  algorithm  that  computes  a  Hamilton  path. 

There  are  two  known  proofs  for  the  existence  of  Hamilton  paths  in  tournaments.  One 
proof  is  Redei's  proof  [Re],  and  the  other  is  the  aforementioned  separator  tlieorcm.  In  view 
of  our  results,  the  first  proof  corresponds  to  insertion-sort,  whereas  the  second  lo  quick-sort. 
In  fact,  our  results  imply  that  the  known  sequential  sorting  algorithms  also  compute  a 
Hamilton  path  in  an  arbitrary  tournament. 

The  equivalence  between  sorting  and  computing  a  Hamilton  patlj  holds  both  in  the 
comparison  model  and  in  PR.AMS.  Valiant’s  comparison  model  [Va]  can  bo  easily  generalized 
to  arbitrary  tournaments.  In  this  model  we  prove  that  the  complexity  of  computing  a 
Hamilton  path  in  an  arbitrary  tournament  and  in  a  transitive  one  is  the  .same.  It  follow.s 
that  all  the  low^cr  bounds,  upper  bounds  and  processor-time  tradeoffs  for  sorting,  apply  also 
when  computing  a  Hamilton  path  in  an  arbitrary  tournament.  These  bounds  and  tradeoffs 
have  been  proved  in  a  series  of  papers,  (AAl,  AA2,  AAV,  AL  AV,  BHe,  BHo,  Pj.  Hell 
and  Rosenfeld  [HR]  have  also  considered  the  sequential  complexity  of  algorithms  in  the 
comparison  model  for  computing  generalized  paths  in  tournaments. 

The  situation  with  implementing  our  translation  method  in  the  PRAM  model  is  more 
complex.  The  difficulty  is  that  it  requires  the  computation  of  a  minimal  feedback  edge  set 
in  a  directed  graph.  It  is  not  known  whether  an  NC  algorithm  exists  for  this  problem. 
We  consider  two  PR.AM  sorting  algorithms,  and  give  for  these  special  cases  a  non  trivial 
procedure  that  computes  a  minimal  feedback  set  in  constant  time.  Tlio  algorithms  are 
Cole’s  merge-sort  [Co]  and  Valiant’s  merging  algorithm  [Va],  and  our  results  are  in  the 
CRCVV  model.  Hence,  a  Hamilton  path  can  be  computed  in  O(logn)  time,  and  two  paths 
of  length  n  can  be  merged  in  O(loglogn)  time;  both  algorithms  use  0(n)  processors. 

These  two  algorithms  achieve  an  optimal  speed  (up  to  a  constant  factor)  with  respect 
to  the  sequential  complexity.  Notice  that  in  our  case  the  number  of  processors  is  linear  in 
the  number  of  vertices  and  not  edges.  This  result  is  interesting  by  itself. 

Our  results  also  imply  an  0(n  log  n)  sequential  algorithm  for  computing  a  Hamilton  path 
in  an  arbitrary  tournament.  As  we  already  have  mentioned,  merge-sort  will  also  output  a 
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Hamilton  path  for  an  arbitrary  tournameiil.  Tins  Iniund  also  follows  from  Red^i's  proof  (Rej, 
but  only  when  appropriate  data  structures  are  used.  It  ran  be  proved  that  Batcher's  sorting 
network  [Ba],  also  computes  a  Hamilton  path  when  the  injnit  is  an  arbitrary  tournament. 
However*  it  is  not  obvious  whether  tlm  AKS  sorting  network  [AKS]  can  be  adapted  to 
tournaments.  This  motivates  the  search  for  a  sorting  network  whose  depth  is  less  tlian 
0{log^n)  and  also  computes  Hamilton  paths.  It  would  also  imply  better  bounds  for  EREW 
algorithms. 

As  for  the  Jt-selection  problem,  it  is  ea*sy  to  show  a  lower  bound  of  ^(log  n/loglogn) 
time  for  any  PRAM  model  even  in  the  case  of  transitive  tournaments.  Hence,  the  best 
strategy  (up  to  a  factor  of  O(loglogn)  would  l)e  first  to  compute  a  Hamilton  path,  and 
then  solve  the  ifc-selection  problem.  On  the  other  hand,  in  the  comparison  model,  the  result 
of  (AKSS)  implies  an  O(loglogn)  upper  bound. 

2  Terminology  and  preliminaries 

Let  r  =  (V^£)  denote  a  tournament,  that  is  a  complete  graph  whose  edges  are  oriented. 
Let  the  cardinality  of  the  vertex  set  be  denoted  !\v  ».  If  an  edge  is  oriented  from  r  to  tr, 
then  we  say  that  v  is  smaller  than  xv  (w  is  greater  than  r)  and  denote  it  either  by  v  <  xt\ 
or  by  r  u;.  A  path  vq,  Vi, . . Vjt  is  a  secpicnce  of  vertices  where  r,  <  and  r,  ^  Vj  for 
0  <  i  ^  j  <  A:.  A  cycle  is  a  path  where  Vo  =  If  vortex  r  precedes  vertex  ir  in  a  path  P, 
then  V  <  XV  with  respect  to  the  path  P  or  r  is  below  w  (w  is  above  n).  The  first  and  last 
vertices  in  the  path  are  called  bottom  and  top  respectively. 

A  tournament  is  transitive  if  and  only  if  it  is  acyclic.  An  implication  in  a  transitive 
tournament  is  an  edge  whose  orientation  is  implied  by  the  orientation  of  other  edges  in  the 
tournament,  namely  to  avoid  cycles. 

Let  P  and  Q  be  two  paths  in  an  arbitrary  tournament  and  let  R  be  the  result  of  merging 
P  and  Q.  Then  this  merge  has  the  following  property:  if  r  <  tr  with  respect  to  P  (Q),  then 
also  V  <  xr  with  respect  to  R. 

An  edge  feedback  set  F  in  a  directed  graph  C$  -  (V*  D)  is  a  set  of  edges  such  that 

=  {\\D-  F)  is  acyclic.  Computing  such  a  set  of  minimum  cardinality  is  NP-coniplete 
[GJ],  whereas  computing  a  minimal  such  set  with  respect  to  containment  can  be  easily  done 
in  polynomial  time  by  a  greedy  algorithm,  .\n  easy  property  of  minimal  feedback  sets  is 
that  the  graph  resulting  from  inserting  any  edge  <»f  F  in  G'  is  cyclic. 

The  graph  G  =  is  called  a  complete  path  directed  bipartite  graph  (abbreviated 

CPB)  if  Q  contains  all  the  edges  between  V’  and  IT,  and  the  graph  induced  by  V  (H^)  is  a 
directed  Hamilton  path. 


3  Hamilton  paths,  minimal  feedback  sets  and  the  compar¬ 
ison  model 

In  this  sortion  we  show  how  Hamilton  paths  and  minimal  feedback  sets  are  related  to  each 
other  in  tournaments.  VVe  first  extend  the  parallel  comparison  model  to  tournaments.  Tins 
model  was  first  introduced  by  Valiant  [Va]  for  the  purpose  of  analyzing  sorting  algorithms: 
only  comparisons  are  taken  into  account  in  it,  wherea^s  internal  processor  computation  and 
comnniniration  are  not  charged  for.  The  structure  of  an  algorithm  in  this  model  is  the 
following:  in  each  round  a  sot  of  element  pairs  are  compared  until  the  output  is  known. 
The  aim  of  an  algorithm  is  to  minimize  both  the  number  of  rounds  and  the  total  number 
of  comparisons. 

We  extend  this  model  for  tournaments  in  the  natural  way.  The  answer  to  a  comparison  is 
the  orientation  of  an  edge  in  the  tournament.  Hence,  in  the  beginning  we  have  a  tournament 
whose  edge  orientation  is  unknown,  and  at  each  round,  we  a^k  for  the  orientation  of  a  set 
of  edges.  The  algorithm  proceeds  till  the  induced  graph  of  the  known  edges  contains  the 
solution.  Let  /(;>.  A*)  denote  the  minimum  number  of  comparisons  needed  to  compute  a 
function  /  in  A  rounds  and  p  processors.  As  we  already  have  mentioned,  sorting  can  be 
viewed  as  computing  a  Hamilton  path  in  a  transitive  tournament. 

The  next  easy  lemma  is  used  by  our  main  theorem  that  follows  immediately. 

Lemma  3.1  Let  r  =  —  u  be  an  edge  in  a  transitive  tournament  whose  direction  is  known 

by  implication.  Then  e  cannot  appear  on  the  Hamilton  path. 

Proof;  If  the  orientation  of  e  is  known  by  implication  then  there  exists  an  element  w  such 
that  u  — ‘  w  and  xv  —  v.  Hence,  e  cannot  appear  on  the  Hamilton  path.  | 

Theorem  3.1  Let  A  be  an  algorithm  that  computes  a  Hamilton  path  in  a  transitive  tournament 
with  complexity  H{p.k).  Then  there  exists  an  algorithm  f?  that  computes  a  Hamilton  path  in 
a  non  transitive  tournament  T  with  the  same  complexity  Il(p.k). 

Proof:  Let  , . . . ,  Qi;  be  disjoint  sets  of  directed  edges  and  let  F  =  Fi  U  fb  U  •  •  •  U  be 
a  set  with  the  following  properties: 

1.  Fi  C  Q.  . 

2.  F\  is  a  minimal  feedback  set  in  the  graph  induced  by  Qi. 

3.  Inductively,  is  a  minimal  feedback  set  in  the  graph  induced  by  {Qi  -  Ti)  U  {Qi  - 
F2)U 

It  is  easy  to  verify  that  for  every  i,  P  =  Fi  U  •  •  •  U  is  a  minimal  feedback  in  the  graph 
induced  by  U  •  •  •  U  Q,. 
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Doiiotc  !)>•  -’Fi  tin*  set  of  edges  of  F  such  that  the  orientation  of  each  edge  in  Fi  is 
flipped  and  let  Q',  =  Q,-  F,U  if,.  Define  A(Ri,...,Rt)  to  be  the  set  of  comparisons  in 
round  /  +  1  of  algorithm  A  under  the  assumption  that  the  comparisons  in  the  first  i  rounds 

were  R\ . R,.  With  these  notations,  the  set  of  comparisons  of  .\lgorithm  B,  Qi,.  .■,Qk^ 

will  be  derived  from  algorithm  A  in  the  following  way; 

1.  Qi  -^(0) 

2.  Q.+i 

First  we  show  that  Algorithm  B  is  well  defined  by  proving  that  /f,  =  (?',  U  •  •  •  U  <?■_,  is  a 
legal  input  to  round  i  of  algorithm  A.  It  is  easy  to  verify  that  H,  is  a  legal  input  if  it  is 
acyclic.  .Assume  to  the  contrary  that  there  is  a  cycle  c  in  it.  The  edges  of  Hi  are  of  two 
types:  edges  of  the  -.ninimal  feedback  set  f‘  that  were  flipped,  or  unflipped  edges.  By  the 
definition  of  a  minimal  feedback  set,  for  every  edge  e  =  (u  -*  t>)  €  f,  there  is  a  path  p*  of 
unflipped  edges  from  t-  to  it.  Now,  exchange  every  edge  e  €  c  that  belongs  to  f  *  by  pg  and 
get  as  a  result  a  cycle  of  edges  that  were  not  flipped.  If  that  cycle  is  not  simple,  it  contains 
a  simple  cycle  as  a  subgraph,  hence  contradicting  f‘  being  a  minimal  feedback  set. 

Now  that  .Algorithm  B  is  well  defined,  assume  to  the  contrary  that  its  output  p  is  not 
a  Hamilton  path  in  T.  The  path  p  is  not  a  Hamilton  path  only  if  it  contains  edges  whose 
orientation  wa.s  flipped,  namely  edges  that  belong  to  f,  the  minimal  feedback  set.  This 
cannot  happen  a.s  the  edges  of  F  are  implications  in  the  graph  induced  by  Hk,  and  therefore 
cannot  apjiear  on  the  Hamilton  path  by  the  lemma  3.1.  | 

Corollary  3.1  Let  A  be  an  algorithm  that  merges  two  paths  in  a  transitive  tournament  with 
complexity  A/(p,  /;)•  Then  there  exists  an  algorithm  B  that  merges  two  paths  in  a  non  transitive 
tournament  with  the  same  complexity.  | 

Another  problem  that  was  extensively  studied  in  transitive  tournaments  is  the  L-selection 
problem:  find  an  element  larger  than  k  -  I  elements  and  smaller  than  n  -  k  elements.  This 
problem  can  be  generalized  to  non  transitive  tournaments  in  two  ways.  The  first  one  is 
to  find  an  element  in  the  tournament  (if  one  exists)  whose  indegree  is  exactly  equal  to  k. 
It  can  be  shown  that  the  minimum  number  of  comparisons  needed  to  determine  such  an 
element  is  Qfii^)  [Ra],  and  therefore  the  running  time  cannot  be  polylogarithmic  with  a 
linear  number  of  processors. 

■A  more  rela.xed  definition  is  to  find  an  element  v  in  the  tournament  such  that  v  is  the 
k-lh  element  in  some  Hamilton  path.  We  show  that  the  complexity  of  this  problem  is 
equivalent  to  the  complexity  of  fc-selection  in  the  transitive  case. 
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Corollary  3.2  Let  A  be  an  algorithm  that  solves  the  A:-selection  problem  in  transitive  tour¬ 
naments  with  complexity  S(p.k).  Then  there  exists  an  algorithm  B  that  solves  the  it-selection 
problem  in  non  transitive  tournaments  with  the  same  complexity. 

Proof:  Algorithm  A  is  translated  into  B  in  th^'  same  way  as  in  Theorem  3.h  The  corollary 
follows  from  the  observation  that  any  A'-selection  algorithm  can  be  viewed  as  a  step  in  a 
sorting  algorithm.  B 

It  follows  from  Corollary  3.2  that  the  ^.'-selection  algorithm  of  [AKSS]  in  the  comparison 
model  whose  time  complexity  is  0(logIog7?)  with  a  linear  number  of  processors,  can  be 
applied  to  arbitrary  tournaments  as  well.  On  the  other  hand,  in  the  PRAM  model,  we  have 
the  following  theorem  that  implies  that  the  best  strategy  for  Ar-selection,  (up  to  a  factor  of 
O(loglogn)),  would  be  first  to  compute  a  Hamilton  path. 

Theorem  3.2  There  is  a  lower  bound  of  fi(log7?y  loglog  7?)  on  the  complexity  of  the  fc- 
selection  problem  for  transitive  tournaments  in  the  PRAM  model. 

Proof:  This  follows  easily  from  the  lower  bound  of  [Bea]  on  computing  the  exclusive  OR  of 
a  bit  vector,  as  it  also  implies  a  lower  bound  of  fi{log7//  log  log  n)  on  sorting.  Assume  there 
was  a  ^-selection  algorithm  whose  time  complexity  was  better  than  0(Iog77).  Invoking  this 
algorithm  n  times  simultaneously  for  k  =  1,2,  — ,7?  would  imply  a  better  bound  for  sorting. 
Hence,  a  contradiction.  | 

Another  consequence  of  Theorem  3.1  is  the  following  relation  between  Hamilton  paths 
and  minimal  feedback  sets. 

Theorem  3.3  Let  V  be  the  set  of  all  Hamilton  paths  In  a  tournament  T  and  let  T  be  the  set 
of  all  minimal  feedback  sets  in  a  tournament  T.  There  is  a  one  to  one  correspondence  between 
V  and  T, 

Proof:  We  first  show  how  a  minimal  feedback  set  F  can  be  computed  from  a  given  Hamilton 
path  p.  For  every  pair  of  vertices  r,  tn  such  that  v  precedes  tu  in  the  path,  add  the  edge 
(r,  m)  to  F  if  it  is  oriented  from  w  to  v.  Obviously,  F  is  a  minimal  feedback  set  and  two 
different  paths  cannot  generate  the  same  minimal  feedback  set.  We  now  prove  the  other 
direction.  Let  F  be  a  given  minimal  feedback  set  in  a  tournament  T  and  let  T'  be  the 
tournament  in  which  the  orientation  of  the  edges  in  F  were  flipped.  By  Theorem  3,1,  a 
Hamilton  path  in  V  is  also  a  Hamilton  path  in  T.  let  F]  and  F2  be  two  minimal  feedback 
sets  and  let  t;  — >  t£;  be  an  edge  in  Fj  and  not  in  F2.  Fi  and  F2  cannot  generate  the  same 
path  as  T{  will  contain  r  lu  and  FJ  will  contain  v  -r  w.  | 

It  follows  from  Theorem  3.3  that  all  the  results  on  the  cardinality  of  V  [Mo]  apply  to 
T  as  well.  A  criterion  to  decide  whether  there  is  a  path  in  a  tournament  that  starts  at  a 
given  vertex  t;  and  ends  at  some  other  given  vertex  w  also  follows  from  Theorem  3.3.  Let 
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in,,  (out,.)  be  the  set  of  incoming  {oiitroming)  edges  into  v  (out  of  v). 

Corollary  3.3  A  necessary  and  sufficient  condition  fo^  the  existence  of  a  Hamilton  path  from 
i>  to  u’  is  the  existence  of  a  minimal  feedback  set  containing  both  in^  and  out,,, 

Proof:  Assume  that  such  a  inini:nal  feedback  set  exists.  After  its  edges  are  flipped,  v 
becomes  a  source  and  w  a  sink  and  hence,  according  to  Theorem  3.1,  there  is  a  Hamilton 
path  from  v  to  w.  If  a  Hamilton  path  from  v  to  tr  exists,  then  tTi„  and  outu,  belong  to  its 
corresponding  minimal  feedback  set.  (Same  con.struction  as  in  Theorem  3.2).  | 

4  The  CRCW  model 

In  this  section  we  exhibit  algorithms  for  sorting  and  merging  in  an  arbitrary  tournament 
that  are  based  on  Theorem  3.1.  The  complexity  of  these  algorithms  matches  the  complexity 
of  the  corresponding  algorithms  in  transitive  tournaments. 

In  the  first  subsection  we  present  Algorithm  MFS  that  in  0(1)  time  finds  a  minimal 
feedback  set  in  a  CPB  graph  G  -  (P,  IT,£).  In  the  other  two  subsections,  we  show  how  to 
translate  certain  algorithms  for  sorting  and  merging  in  transitive  tournanir  nts,  to  algorithms 
in  arbitrary  tournaments  by  calling  MFS  as  a  subroutine.  It  turns  out  that  computing  a 
minimal  feedback  set  in  our  special  crses,  reduces  to  computing  it  in  a  CPB  graph.  Wo 
shall  prove  that  in  each  case,  iho  feedback  set  computed  by  repeated  calls  to  .Algorithm 
MFS  is  indeed  the  minimal  feedback  set  required  by  Theorem  3.1.  The  algorithms  cho.scn 
are  (they  both  use  0[n)  proce.ssors): 

Merging:  Valiant’s  merging  algorithm  [Va]  with  O(loglogn)  time  complexity. 

Sorting:  Merge-Sort  [Co]  with  O(logn)  time  complexity. 

4.1  Finding  the  minimal  feedback  set 

Algorithm  MFS  invokes  the  two  procedures  M.AXINDEX  and  MAXPREFIX.  We  now 
present  them  and  prove  that  each  can  be  implemented  in  0(1)  time. 

Procedure  MAXINDEX(.-l): 

input:  A  binary  sequence  .d  =  «] - ,n(;  I  processors. 

output:  The  maximum  index  k.  \  <k  <  L  such  that  n*  =  1. 


Lemma  4.1  MAXINDEX  can  be  implemented  m  the  CRCW  model  in  0(1)  time 

Proof:  It  is  well  known  tliat  coni]Mitiii;-;  t!io  OR  ( respect ively,  AND)  of  a  binary  .sequence 
of  l  .'iigtli  (  with  I  processors  takes  0(1)  time  in  tlie  CRCW  model. 

Urst  we  sliow  liow  to  compute  MAXINDEX  in  0(1)  time  witli  processors.  Each 
element  of  A  such  that  =  1  computes  b,  =  max,<j<rn(j)  in  0(1)  time.  (This  is  possible 
as  each  a,  has  C  urocessors  available).  If  =  1,  then  there  exists  an  index  j  >  i  such  that 
aj  =  1,  and  therefore  i  cannot  be  the  desired  answer.  In  this  case  we  set  a,  =  0  and  clearly, 
only  one  value  remains  equal  to  1.  The  index  of  this  value  is  the  answer. 

Now  we  show  how  to  compute  M.4XINDEX  in  0(1)  time  in  the  CRCW  model  with 
only  f  processors.  (For  clarity,  assume  that  \/f  is  an  integer.) 

1.  Divide  the  t  elements  of  the  sequence  into  y/C  blocks  where  each  contains  \/f  consec¬ 
utive  items. 

2.  For  1  <  !  <  \/f,  let  c,  denote  the  OR  of  the  »-th  block. 

3.  Let  k  be  the  output  of  Procedure  MAXINDEX  when  the  input  is  the  vector  Cj. . . . ,  c 
This  takes  0(  1 )  time  as  there  are  C  processors  and  the  vector  is  of  length  y/C. 

1.  Let  k*  be  the  output  of  Procedure  MAXINDEX  when  the  input  is  the  fc-th  block. 
This  takes  0(1)  time  as  there  are  f  processors  and  the  block  is  of  length  v/f. 

•5.  The  output  is  k'. 

I 

Procedure  MAXPREFIX(A,  J3): 

input:  A  sequence  A  =  oj, . . .  ,ar  of  integers;  processors. 

output:  A  sequence  J3  =  of  integers  such  that  for  every  i,  1  <  »  <  f, 

6,  =  max  {nt). 

i<t<i 

Lemma  4.2  MAXPREFIX  can  be  implemented  in  the  CRCW  model  in  0(1)  time. 

Proof:  First  construct  in  0(1)  time  a  matrix  M  =  wliere  nnj  =  1  iff 

«,•  >  ttj,  or  i  =  j.  Each  row  i  calls  a  variation  of  procedure  MAXINDEX  and  computes 
in  0(1)  time:  the  minimum  index  j  in  the  subrow  such  that  njjj  =  0  and 

mu+i  =  •  •  •  =  m,j_i  =  1.  For  every  k,  l<k<i  or  j<k<£  set  m,-.t  =  0.  If  now 
=  1>  then  a,  >  aj  and  for  each  k,  i  <  k  <  j,  it  is  also  known  that  «,  >  a*. 

Therefore,  it  now  remains  for  each  aj  to  find  the  minimal  index  i  such  that  nnj  =  1. 
Again,  this  can  be  done  in  time  0(1)  by  calling  a  variation  of  procedure  MAXINDEX  for 
each  column  of  the  matrix  M.  | 


Figure  1:  The  three  types  of  cycles. 


Wo  are  now  ready  to  present  Algorithm  MFS. 

Algorithm  MFS: 

Input:  A  complete  path  bipartite  graph  G  =  (V^ir,Q);  V  =  =  wj, . . . 

0{r^  +  rs)  processors. 

Output:  A  minimal  feedback  set  F  in  C?. 

For  all  1  <  t  <  r  : 

1.  Define  a(i)  as  follows: 

(a)  If  Vi  >  tV3,  then  a{i)  =  .s: 

(b)  else,  if  Vj  <  tvt,  then  //(i)  =  0; 

(c)  otherwise,  let  a{i)  be  the  maximum  index  j  such  that  Wj  <  Vi  < 

2.  Let  5(t)  be  the  maximum  value  among  {a(l a(*)}. 

3.  For  all  j,  1  <  j  <  b{i):  if  Vi  <  Wj  then  add  the  edge  r,  tvj  to  F. 


Lemma  4.3  The  set  F  computed  by  Algorithm  MFS  is  a  minimal  feedback  set  in  Q. 

Proof:  F  is  a  minini.il  feedback  .set  \{  ~  Q  -  F  h  acyclic  and  if  adding  an  edge  of  F  to 

Q'  generates  a  cycle. 

Assume  to  the  contrary  that  there  is  a  cycle  in  Q'  and  suppose  that  the  cycle  is  one  of 
the  following  three  types: 

1.  (Figure  la); 

2.  (Figure  lb); 

3.  Vi,  Wj, . . . ,  Wj*,  r,s  . . . ,  r,  (Figure  Ic). 

We  now  show  that  in  the  three  above  cases  b{i)  >  j  and  this  leads  us  to  a  contradiction, 
because  in  Step  3  of  algorithm  MFS,  edge  v,  — ►  wj  €  JP. 

1.  The  edge  tUj  tp  is  in  Q;  therefore  the  maximality  of  a(f')  implies  that  «(i')  >  j 
and  consequently,  b(i)  >  6(F)  >  a(F)  >  j. 

2.  The  edge  xvy  — ►  r,  is  in  Q:  therefore  a(x)  >  f  which  implies  that  6(*)  >  f  >  j. 

3.  The  edge  xvy  is  in  Q:  therefore  b{i)  >  6{F)  >  a(F)  >  y  >  j. 

Now  let  c  be  an  arbitrary  cycle  and  let  Vi  be  its  highest  vertex  in  V.  The  successor  of 
in  c  belongs  to  W,  and  denote  it  by  tUj.  W^e  show  that  there  are  three  possible  cases,  and 
each  implies  the  existence  of  one  of  the  above  three  types  of  cycles. 

If  XV j  is  the  highest  vertex  of  c  in  W  then  its  successor  in  c  is  vp  €  V.  It  follows  that 
F  <  i  (from  v,  being  the  highest)  and  hence  vp,. .  is  a  cycle  in  Q'  of  type  (1). 

Otherwise,  let  xvj*  be  the  highest  vertex  of  c  in  VF  for  some  f  >  j.  Denote  the  successor  of 
xvjt  in  c  by  Up  €  for  F  <  t.  There  are  two  cases.  If  F  =  t  then  in  Q',  r,,  tCj, . . . ,  xvy,  Vj  is 
a  cycle  of  type  (2).  Else,  F  <  t  and  ..,r,  is  a  cycle  in  of  type  (3). 

To  complete  the  proof,  we  have  to  show  that  if  an  edge  Vi  xvj  €  F  is  added  to  Q\ 
then  a  cycle  is  generated.  This  follows  from  the  following  two  facts: 

Fact  1:  The  edge  Vi  is  always  an  edge  in  Q'  as  long  as  a{i)  >  0. 

Fact  2:  if  Vi  wj  €  F  then  j  <  6(t). 

If  J  <  a(i),  then  is  a  cycle  (type  (2)).  Otherwise,  by  the  above  two 

facts  a{i)  <  j  <  b{i).  If  j  =  6{i),  then  there  exists  F  <  t  such  that  a(F)  =  6(t).  By  Fact  1, 
^6(i)  €  (?'  and  v,,  trj,  rp, . .  .,ri  is  a  cycle  (type  (1)).  Else,  j  <  6(t)  and  again,  there 

exists  F  <  t  such  that  a{F)  =  b{i)  >  j  and  by  Fact  1  xvf,^i)  vp  €  It  follows  that 
ViyWjy . .  M  .  •  * ,  IS  a  cycle  (type  (3)).  | 
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Leninin  4.4  Algorithm  MFS  can  be  computed  in  time  ()(  1 ). 

Proof:  It  is  easy  lo  see  how  stop  Ic  can  be  coinpiitod  with  the  h^'lp  of  protodiire  MAXIN- 
DFX  witli  r,s  processors.  Hence,  it  takes  time  0(1)  by  Lemma  4.2.  Step  2  calls  Proceduie 
MAXPREFIX  with  processors  and  can  also  be  computed  in  time  0(1)  by  Lemma  4.3. 
Obviously,  the  rest  of  the  algorithm  can  be  computed  in  time  0(1).  I 

4.2  Merging 

In  this  subsection  we  show  how  Valiant's  [Va]  merging  algorithm  can  be  modified  to  a 
CRCW  algorithm  that  merges  two  paths  in  a  tournament,  VVe  elaborate  on  its  description 
for  two  reasons:  (i)  to  simplify  the  proof  of  Theorem  4.1;  (ii)  to  the  best  of  our  knowledge, 
a  detailed  description  of  Valiant’s  algorithm  in  the  PRAM  model  does  noi  exist.  The 
algorithm  uses  a  linear  number  of  processors  (linear  in  the  length  of  both  path.s)  and  its 
time  complexity  remains  0(loglog  n)  (n  is  the  length  of  A‘,  the  longer  path).  Borodin  and 
Hoperoft  [BH]  proved  a  lower  bound  of  ft(loglog  n)  on  the  complexity  of  merging  two  paths 
and  hence,  our  results  are  tight.  Hereafter,  denote  by  X  =  and  Y  = 

the  two  paths  to  be  merged.  VVe  omit  ceiling  and  floor  for  clarity. 

Valiant's  algorithm  employs  a  divide-and-conquer  method:  in  (9(1)  time,  merging  two 
paths  is  divided  into  merging  many  pairs  of  subpaths  from  X  and  V',  where  in  each  pair, 
the  length  of  the  subpath  belonging  to  X  is  y/n.  Then,  in  0(1)  time,  the  paths  produced 
by  merging  the  pairs  are  concatenated  to  output  tlie  desired  merge.  The  basic  step  of  the 
recursion  is  when  n  is  a  constant  and  then,  the  merge  is  completed  by  performing  all  possible 
comparisons.  The  time  complexity  0(loglogT?)  is  the  solution  of  the  recursive  equation: 

♦  T(a)  =  0(l)  ;  a  =  0(1)  ; 

•  r(n)  =  r(vAr)  +  o(i). 

We  now  explain  how  the  divide-and-conquer  is  achieved.  Each  of  the  two  paths,  X  and 
V,  is  divided  into  subpaths  whose  lengths  are  a  square  root  of  their  original  length.  The 
top  of  every  subpath  is  calletl  its  leader,  and  denote  by  LX  and  LY  the  set  of  leaders  in  A' 
and  Y  respectively.  We  may  sometimes  refer  to  LX  and  LY  also  as  paths. 

The  first  step  of  comparisons  in  the  algorithm  is  between  all  the  leaders  of  LX  with  all 
the  leaders  of  LY,  (This  can  be  done  in  one  step  as  y/iiw  <  n  -|-  m.)  At  this  point,  for  each 
leader  y  €  IV,  there  are  two  successive  leaders  x*  <  x  £  lA',  such  that  y  >  x*  and  y  <  x. 
This  pair  of  leaders  is  uniquely  defined  in  the  case  of  a  transitive  tournament  and  can  be 
computed  in  lime  0(1).  The  exact  place  in  A'  where  y  is  eventually  inserted,  will  be  called 
the  insertion  point  of  y.  Now,  for  two  consecutive  leaders  in  IVX  y*  <  y^  there  is  a  subpath 
LX{y)  in  LX  and  a  snbpatli  V(y)  in  such  that  each  vertex  in  these  snbpaths  is  smaller 


Figure  2:  Proof  of  Theorem  4.1 


than  y  and  greater  than  y'  (Figure  2a). 

The  second  step  of  comparisons  in  the  algorithm  is  performed  between  all  the  vertices 
in  LX{y)  and  Y(y)  for  each  leader  y  €  IX\  After  this  step,  the  insertion  point  in  Y  of  each 
leader  x  €  LX  is  known,  namely  there  are  two  successive  vertices  in  F,  ]f  <  V,  such  that 
X  >  r/  and  x  <  y.  Again,  this  pair  of  vertices  is  uniquely  defined  in  the  case  of  a  transitive 
tournament  and  can  be  computed  in  time  0(1).  Now  for  two  consecutive  leaders  in  LX^ 
X*  <  X,  there  is  a  subpath  A’(x)  in  A’  and  a  subpath  Y(x)  in  T,  such  that  each  vertex  in 
these  subpaths  is  smaller  than  x  and  greater  than  x'  (Figure  2b). 

Now  the  subpaths  can  be  merged  simultaneously.  For  every  leader  x  £  LX ^  the  algo¬ 
rithm  merges  recursively  the  paths  A'’(x)  and  VT^)  into  the  path  Z{x).  After  the  recursion 
is  completed,  x  (x')  can  be  inserted  above  (below)  Z(x). 

Thus,  by  merging  y/n  pairs  of  subpaths  and  then  concatenating  them  through  the  leaders 
of  A,  the  desired  merge  is  achieved. 

The  comparisons  steps  that  precede  the  recursive  calls  in  Valiant’s  algorithm  incur  two 
problems  in  an  arbitrary  tournament.  A  leader  y  €  LY  (x  €  LX)  may  have  several  insertion 
points  in  LX  (F);  furthermore,  if  y'  <  y  (x'  <  x)  then  the  insertion  point  of  /  (xO  is  not 
necessarily  smaller  than  that  of  y  (x). 

To  overcome  these  problems,  we  compute  a  minimal  feedback  set  in  the  graph  induced 
by  the  comparisons,  as  suggested  by  Theorem  3.1.  Whenever  a  set  of  vertices  is  compared, 
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a  Tuitiiiual  fecdUack  sot  is  computed  and  the  orientation  of  its  ed,fz;cs  is  flipped.  In  all  cases, 
the  minimal  feedback  set  is  either  the  output  of  Algorithm  MFS,  or  the  union  of  outputs 
of  differenl  calls  to  Algorithm  MFS.  There  arc  four  c  ases  summarized  below: 

1)  The  base  case  of  the  recursion:  Here,  n  is  a  constant,  and  the  minimal  feedback  set 

is  the  output  of  Algorithm  MFS  computed  on  the  whole  graph. 

2)  The  first  round  of  comparisons  before  the  recursive  calls:  Here,  the  leaders  of 

LX  and  LY  are  compared  amd  again,  the  minimal  feedback  set  is  the  output  of 
Algorithm  MFS. 

3)  The  second  round  of  comparisons  before  the  recursive  calls:  The  graph  induced 

by  these  comparisons  is  a  union  of  CPB  graphs  and  Algorithm  MFS  is  invoked  for 
each  CPB  graph.  The  minimal  feedback  set  is  the  union  of  the  minimal  feedback  .sets 
computed  for  each  CPB  graph. 

4)  The  rounds  during  the  recursive  calls:  The  structure  of  the  comparison  graph  is 

the  same  as  in  case  (3)  and  the  minimal  feedback  set  is  the  union  of  the  minimal 
feedback  sets  computed  for  each  CPB  graph. 

Theorem  4-1  The  modification  of  Valiant’s  merge  algorithm  according  to  Theorem  3.1  merges 
two  paths  in  an  arbitrary  tournament. 

Proof:  We  prove  it  by  induction  on  fc,  the  depth  of  the  recursion.  When  1:  =  1  (case  1), 
namely  the  length  of  one  of  the  merged  paths  is  a  con.stant.  the  correctness  follows  as 
Algorithm  MFS  computes  a  minimal  feedback  set  (Lemma  4.3).  Assume  that  the  theorem 
is  true  when  the  recursion  depth  is  less  than  k, 

III  the  first  stop  of  comparisons  (case  2),  the  leaders  of  X  and  1  are  compared.  It 
follows  again  from  lemma  4.3  that  a  minimal  feedback  set  is  indeed  computed.  Hence,  the 
picture  depicted  by  Figure  2a  is  valid.  Now  assume  that  the  second  round  of  comparisons 
before  the  recursive  calls  took  place  (case  3)  and  assume  also  that  the  union  of  minimal 
feedback  sets  computed  for  edge  disjoint  subgraphs  is  not  a  feedliack  set.  To  disprove  this 
last  assumption,  it  is  enough  to  show  for  every  cycle  that  all  of  its  c'dges  belong  to  the  same 
subgraph.  We  prove  the  following  claim: 

Claim:  For  each  leader  y  €  LYy  if  a  vertex  from  LX{y)  (or  Y[y))  takes  part  in  a  cycle, 
then  that  cycle  is  completely  contained  in  iA  (y)uy(y). 

Proof  of  the  claim:  Construct  the  following  graph  //:  for  each  leader  y  €  LY  associate  a 
vertex  /(y)  in  //  which  is  the  contraction  of  LX{y)  and  y(y)  e.g.,  a  vertex  in  H  can  be 
viewed  as  a  set  of  “old’’  vertices.  I^et  o  and  6  be  two  vertices  in  H;  the  edge  0-^6  exists  if 
before  the  contraction,  there  was  an  edge  oriented  from  a  vertex  in  the  set  a  to  a  vertex  in 
the  set  6. 


Notice  that  the  leader  y  is  the  only  vertex  in  f{y)  that  was  compared  to  vertices  in 
LX{y')  for  any  leader  y'  <  y.  The  wav  the  sef!;nients  in  LX  were  chosen  implies  that  y  is 
greater  than  ail  the  vertices  in  LX(y^)  and  clearly  y  is  greater  than  all  the  verticeN  ifi 
Hence,  H  is  well  defined  and  is  isomorphic  to  a  path.  The  correctness  of  the  claim  follows 
imniedia*  Ay,  | 

In  the  rounds  during  the  recursion  (case  4),  similar  arguments  to  the  above  hold.  The 
analogous  claim  is  that  for  each  leader  t  G  LX ^  if  a  vertex  from  A'(.t)  (or  1’(j))  takes  part 
in  a  cycle,  then  that  cycle  is  completely  contained  in  LX{y)UY(y).  Therefore,  it  is  enough 
to  compute  a  minimal  feedback  set  in  each  CPB  graph.  | 

Theorem  4.2  The  time  complexity  of  the  modified  algorithm  is  O(loglog»)  when  0(«)  pro¬ 
cessors  are  available. 

Proof:  Define  r(n,m)  to  be  the  time  complexity  of  the  modified  algorithm.  By  Lemma 
4.4,  Algorithm  MFS  can  be  implemented  in  time  0(1)  if  the  number  of  available  processors 
is  0(r^  -h  r^),  where  r  and  s  denote  the  lengths  of  the  input  paths.  We  need  to  show  that 
whenever  Algorithm  MFS  is  invoked  there  is  a  sufficient  number  of  processors  available. 

If  u  is  a  constant,  the  claim  follows  from  the  fact  that  0(rt  4  m)  =  0{nm),  When  LX 
and  LY  are  compared,  the  claim  follows  from  the  inequality  y/nrn  <  ??  4-  m. 

If  ^  Therefore,  there  are  enough  processors  for  the  second 

step  of  comparisons  that  precedes  the  recursive  call.  Each  subpath  y(y)  from  V  that  is 
merged  with  o  leaders  from  LX,  receives  0(q^  4-  Q^^m)  processors. 

For  the  simultaneous  recursive  calls  there  are  enough  processors,  as  each  submerge  of 
two  paths  of  length  n'  and  m'  gets  0(7)'  4-  processors. 

Clearly,  the  rest  of  the  algorithm  can  be  implemented  in  constant  time  same  as  the 
transitive  case.  Hence  the  recursive  equation  is: 

T(n,  m)  =  T{y/n,  m')  4*  c, 

for  some  constant  c.  The  solution  of  the  above  equality  is  O(loglogn).  | 

4.3  Sorting 

In  this  subsection  we  show  how  a  path  in  a  tournament  can  be  found  in  the  CRCW  model 
in  O(logn)  time  using  0(n)  processors.  We  rely  on  Cole’s  merge-sort  [Co]  algorithm  and 
compute  a  minimal  feedback  set  in  0(1)  time  for  each  round  of  comparisons.  We  shdH 
give  only  an  outline  of  Cole’s  algorithm,  and  elaborate  on  the  steps  where  comparisons  are 
performed.  W'e  refer  the  reader  to  the  description  of  Cole’s  algorithm  in  [GR]  as  our  outline 
depends  on  it  and  uses  its  terminology. 
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In  the  previous  subsection,  the  merging  algorithm  implies  a  sorting  algorithm  whose 
complexity  is  0{log  n  loglogu).  To  improve  the  complexity  to  O(logn),  Cole  showed  how 
to  pipeline  the  merging  stops.  Let  T  be  the  complete  binary  tree  that  describes  the  standard 
merge-sort  algorithm;  a  typical  node  v  in  T  merges  two  lists  X  and  1'  when  both  A  and  ) 
are  sorted.  The  novelty  of  Colo's  algorithm  is  that  node  v  starts  processing  the  lists  A'  and 
Y  before  they  are  sorted.  Namely,  at  the  ith  step,  two  lists  A",-  and  Yi  are  merged,  where 
Xi  is  a  sorted  sample  of  X,  and  Yi  is  a  sorted  sample  of  T.  This  merge  can  be  computed 
in  constant  time  if  the  results  of  the  i  —  1st  step  are  known. 

For  the  sake  of  simplicity,  assume  that  n  is  a  power  of  2.  Let  Tv  be  the  subtree  rooted 
at  V  and  /isfy  be  the  list  of  elements  stored  initially  at  the  leaves  of  Ty.  Let  val^f  be  the 
current  list  associated  with  node  v  of  the  tree.  The  sequence  tn/y  will  always  be  a  sorted 
subsequence  of  //sft,,  and  will  double  its  size  in  each  round,  VVe  say  that  a  node  is  complete 
if  and  only  if  atly  =  Let  us  now  describe  what  happens  in  a  typical  internal  node  r 

of  the  tree  during  the  course  of  the  algorithm  (Figure  3). 

In  the  fth  step,  node  v  receives  from  its  left  child  and  right  child  sequences  Xi  and 
respectively.  It  merges  the  two  sequences  to  a  sorted  sequence  valy  with  the  help  of  the  old 
valy.  If  V  is  incomplete,  then  it  sends  every  fourth  element  of  valy  to  its  parent.  During 
the  first  step  after  v  becomes  complete,  every  second  element  is  sent  up,  whereas  in  the 
step  after  that,  every  element  is  sent  up.  The  sequence  Z,  sent  by  v  to  its  parent  is  called 
rcdtice(vo/v). 

The  algorithm  begins  when  all  the  leaves  send  their  value  to  their  parents.  A  node 
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Figure  4:  Merging  .V  and  Y  with  the  help  of  a  good  sampler  Z. 

terminates  two  steps  after  it  became  complete,  and  the  algorithm  ends  when  the  root 
becomes  complete.  It  follows  from  the  way  reduce(ro/„)  is  defined,  that  two  rounds  after 
r  became  complete,  its  parent  becomes  complete.  Therefore,  the  algorithm  takes  O(logn) 
rounds  of  internal  computation  in  the  nodes  of  the  tree.  In  order  to  achieve  the  desired 
complexity,  Cole  showed  how  to  implement  each  such  round  in  constant  time. 

We  now  need  some  notations.  The  rank  of  an  element  x  in  a  sequence  A*,  rank{x^  X}^  is 
the  number  of  elements  in  A"  preceding  x.  The  cross  rank  from  A'  to  y ,  denoted  by  R[X^  Y], 
is  the  function  for  which  fi[A',y'](x)  =  rank{xyY)  for  each  x  €  A*.  A  sorted  sequence  A’  is 
a  good  sampler  of  the  sorted  sequence  Y  if  and  only  if,  between  any  k  +  l  adjacent  elements 
of  {~oo}uA‘ U{oo}  there  are  at  most  2A:+1  elements  of  V.  In  our  case,  assume  that  A*  =  1, 
i.e.,  between  any  two  element  of  Y  there  are  at  most  three  elements  of  A*.  Note  that  it  is 
always  true  that  r€duce{X)  is  a  good  sampler  of  A\ 

The  motivation  behind  these  notations  is  as  follows.  The  cross  rank  7Z[A\y]  (/?[y.  A']) 
enables  us  to  merge  the  sorted  sequences  X  and  Y  in  time  0(1).  In  this  sense,  the  merge  of 
two  sequences  and  their  cross  ranks,  are  equivalent.  Our  description  relies  heavily  on  this 
fact.  It  is  also  easy  to  merge  in  constant  time  two  sorted  sequences  A"  and  Y  with  the  help 
of  a  sequence  Z  which  is  a  good  sampler  of  both  (Figure  4). 

The  basic  property  of  the  algorithm  that  entails  its  correctness  is  well  demonstrated  by 
the  following  invariant  preserved  at  each  step: 
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Main  invariant:  if  each  A,  is  a  good  sampler  of  A,-fj,  and  each  i,  is  a  good  sampler  of 
then  each  Z,  is  a  good  sampler  of  Z,+i. 

In  terms  of  cross  ranks,  the  merge  of  and  V.+i  is  performed  in  three  steps: 

1.  The  four  cross  ranks:  /J[A"nyt+i],  R[Yi,Xi^i],  /i![A\+i,y,)  and  iZ(y,+i,Art)  are  com¬ 
puted  with  the  help  of  /?[A',,  A"i>i]  and  /?[y,y,+i]. 

2.  The  two  cross  ranks:  /?[A,+i,y;+i]  and  ifiy+i,  A'.+i]  are  computed  with  the  help  of 
the  cross  ranks  of  the  previous  step.  (As  stated  before,  this  is  equivalent  to  merging 
A^f+i  ^^d  yj-fj). 

3.  The  cross  rank  /?(Zi,Z,>i]  =  /?[rer/nce(A\  U  Yi)yredttce{Xi^\  U  y+i)]  is  computed 
with  the  help  of  the  previous  calculated  cross  ranks. 

The  three  steps  can  be  implemented  in  0(1)  time  with  0(lA",>i|  +  ly^+il)  processors.  The 
reason  is  that  the  good  sampling  property  insures  us  that  in  each  CPB  graph,  at  least  one 
of  the  paths  is  of  length  at  most  three. 

We  now  explain  how  to  modify  the  above  algorithm  according  to  Theorem  3.1  so  that 
it  computes  a  Hamilton  path  in  an  arbitrary  tournament.  Whenever  a  set  of  vertices  is 
compared,  a  minimal  feedback  set  is  computed  and  the  orientation  of  its  edges  is  flipped. 
Each  stage  of  comparisons  can  be  decomposed  to  edge  disjoint  CPB  graphs  and  in  each  one 
a  minimal  feedback  set  is  computed  by  invoking  Algorithm  MFS.  The  union  of  the  miiiimai 
feedback  sets  computed  in  each  CPB  graph  will  be  a  minimal  feedback  set  in  the  graph 
induced  by  the  set  of  comparisons. 

Theorem  4.3  The  modification  of  Cole's  sort  algorithm  according  to  Theorem  3.1  finds  a 
Hamilton  path  in  an  arbitrary  tournament. 

Proof:  For  arguments  similar  to  those  in  the  proof  of  the  merging  algorithm,  it  follows 
that  the  union  of  the  outputs  of  the  calls  for  algorithm  MFS  in  each  node  of  the  tree  T,  is 
a  minimal  feedback  set  in  the  graph  induced  by  the  comparisons  performed  at  that  node. 

The  theorem  follows  from  the  next  claim  proved  by  induction  on  (fc,  i),  where  k  is  the 
height  of  the  tree  and  t  is  the  current  step. 

•  In  the  ith  step,  in  a  tree  of  height  fc,  the  algorithm  computes  a  minimal  feedback  set. 

•  The  sequence  sent  by  v,  the  root  of  this  tree,  in  the  ith  step  is  consistent  with  the 
previous  sequences  sent  by  v.  Namely,  if  ro/f  contains  the  relation  x  <  y  for  x,  y  € 
listvy  then  in  all  previous  steps,  val^  never  contained  the  relation  y  >  x. 

The  induction  holds  clearly  for  ib  =  1,  namely  the  leaves.  Now  let  T'  be  a  tree  of  height 
fc,  let  V  be  its  root  and  let  t  be  the  current  step.  We  prove  the  first  part  of  the  inductive 
claim  by  showing  that  there  cannot  be  a  cycle  after  the  edges  of  the  minimal  feedback  set 
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in  T\  computed  in  the  tth  step,  are  flipped.  Lei  A'  and  Y  be  the  seqnenres  that  v  received 
from  its  left  and  right  children  respectively.  It  fo!lov;s  from  the  induction  asstiinption  on 
the  height  of  the  tree  that  any  cycle  contained  completely  in  A'  or  in  V  was  handled  by  the 
children  of  v.  On  the  other  hand,  a  cycle  containing  vortices  from  both  A’  and  Y  could  only 
be  generated  by  r.  Assume  there  is  such  a  cycle  and  w.Lo.g.  it  contains  an  edge  between 
vertices  o,6  G  A"  wdiere  «  precedes  b  with  respect  to  A*.  Because  of  the  second  part  of  the 
induction  hypothesis,  this  edge  is  n  6.  However,  such  cycles  are  detected  by  Algorithm 
MFS. 

After  flipping  the  minimal  feedback  set,  the  graph  induced  by  contains  an  instance 
of  Cole’s  merge*sort  algorithm.  Therefore,  the  correctness  of  the  second  claim  follows  from 
the  correctness  of  Cole's  algorithm.  | 

Theorem  4.4  The  time  complexity  of  the  modified  algorithm  is  O(logr?). 

Proof:  A  more  detailed  inspection  of  the  algorithm  shows  that  in  each  CPB  graph  at  least 
one  of  the  paths  is  of  length  at  most  three.  In  Cole’s  algorithm  each  node  in  the  tree  gets 
OdA’t+il-b  lyi^-il)  processor.^,  this  number  is  sufficient  to  calculate  the  calls  for  algorithm 
MFS.  I 
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